A New Algorithm to Optimize Maximal Information Coefficient
نویسندگان
چکیده
The maximal information coefficient (MIC) captures dependences between paired variables, including both functional and non-functional relationships. In this paper, we develop a new method, ChiMIC, to calculate the MIC values. The ChiMIC algorithm uses the chi-square test to terminate grid optimization and then removes the restriction of maximal grid size limitation of original ApproxMaxMI algorithm. Computational experiments show that ChiMIC algorithm can maintain same MIC values for noiseless functional relationships, but gives much smaller MIC values for independent variables. For noise functional relationship, the ChiMIC algorithm can reach the optimal partition much faster. Furthermore, the MCN values based on MIC calculated by ChiMIC can capture the complexity of functional relationships in a better way, and the statistical powers of MIC calculated by ChiMIC are higher than those calculated by ApproxMaxMI. Moreover, the computational costs of ChiMIC are much less than those of ApproxMaxMI. We apply the MIC values tofeature selection and obtain better classification accuracy using features selected by the MIC values from ChiMIC.
منابع مشابه
Application of Single Objective Genetic Algorithm to Optimize Heat Transfer Enhancement from a Flat Plate
The optimal shape of a two dimensional turbulator above an isothermal flat plate is found by using numerical simulation. The turbulent boundary layer over the flat plate was disrupted at various situations by inserting a quadrilateral bar where the boundary layer thickness kept more than three times greater than the insert\'s height. As a result, the overall heat transfer coefficient of the wal...
متن کاملImproved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring
In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...
متن کاملA New Hybrid Algorithm to Optimize Stochastic-fuzzy Capacitated Multi-Facility Location-allocation Problem
Facility location-allocation models are used in a widespread variety of applications to determine the number of required facility along with the relevant allocation process. In this paper, a new mathematical model for the capacitated multi-facility location-allocation problem with probabilistic customer's locations and fuzzy customer’s demands under the Hurwicz criterion is proposed. Thi...
متن کاملAnalyzing Large Biological Datasets with an Improved Algorithm for MIC
The computational framework used the traditional similarity measures to find out the significant relationships in biological annotations. But its prerequisites that the biological annotations do not cooccur with each other is particular. To overcome it, in this paper a new method Improved Algorithm for Maximal Information Coefficient (IAMIC) is suggested to discover the hidden regularities betw...
متن کاملApplication of Evolutionary Algorithm to Optimization of ANNIS Model for Discharge Coefficient Circular Side Spillway Modeling
In this study, the discharge coefficient of the circular side orifices was predicted using a new hybrid method. Combinations made in this study were divided into two sections: 1) the combination of two algorithms including Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) and providing the PSOGA algorithm 2) using the PSOGA algorithm in order to optimize the Adaptive Neuro Fuzzy Infe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 11 شماره
صفحات -
تاریخ انتشار 2016